Measuring Latent Causal Structure
نویسنده
چکیده
Discovering latent representations of the observed world has become increasingly more relevant in the artificial intelligence literature [Hinton and Salakhutdinov, 2006, Bengio and Cun, 2007]. Much of the effort concentrates on building latent variables which can be used in prediction problems, such as classification and regression. A related goal of learning latent structure from data is that of identifying which hidden common causes generate the observations. This becomes relevant in applications that require predicting the effect of policies. As an example, consider the problem of identifying the effects of the “industrialization level” of a country on its “democratization level” across two different time points. Democratization levels and industrialization levels are not directly observable: they are hidden common causes of observable indicators which can be recorded and analyzed. For instance, gross national product (GNP) is an indicator of industrialization level, while expert assessments of freedom of press can be used as indicators of democratization. Extended discussions on the distinction between indicators and the latent variables they measure can be found in the literature of structural equation models [Bollen, 1989] and errorin-variables regression [Carroll et al., 1995]. Causal networks can be used as a language to represent this information. We postulate a graphical encoding of causal relationships among random variables, where vertices in the graph representing random variables and directed edges Vi → Vj represent the notion that Vi is a direct cause of Vj . Formal definitions of direct causation and causal networks are given by Spirtes et al. [2000] and Pearl [2000]. In our setup, we explicitly represent latent variables of interest as vertices in the graph. For example, in Figure 1 we have a network representation for the problem of causation between industrialization and democratization levels. This model makes assumptions about the connections among latent variables themselves: e.g., industrialization causes democratization, and the possibility of
منابع مشابه
Discovering Prerequisite Relationships Among Knowledge Components
Knowing the prerequisite structure among the knowledge components in a domain is crucial for designing instruction and for assessing mastery. Treating KCs as latent variables, we investigate how data on the items that test these skills can be used to discover the prerequisite structure among such skills. Our method assumes that we know or have discovered the Q-matrix (the measurement model) tha...
متن کاملDiscussion of "Learning Equivalence Classes of Acyclic Models with Latent and Selection Variables from Multiple Datasets with Overlapping Variables"
In automated causal discovery, the constraint-based approach seeks to learn an (equivalence) class of causal structures (with possibly latent variables and/or selection variables) that are compatible (according to some assumptions, usually the causal Markov and faithfulness assumptions) with the conditional dependence and independence relations found in data. In the paper under discussion, Till...
متن کاملAnalyzing Individual and Average Causal Effects via Structural Equation Models
Although both individual and average causal effects are defined in Rubin’s approach to causality, in this tradition almost all papers center around learning about the average causal effects. Almost no efforts deal with developing designs and models to learn about individual effects. This paper takes a first step in this direction. In the first and general part, Rubin’s concepts of individual an...
متن کاملDiscovery of Causal Models that Contain Latent Variables Through Bayesian Scoring of Independence Constraints
Discovering causal structure from observational data in the presence of latent variables remains an active research area. Constraint-based causal discovery algorithms are relatively efficient at discovering such causal models from data using independence tests. Typically, however, they derive and output only one such model. In contrast, Bayesian methods can generate and probabilistically score ...
متن کاملExperimental Learning of Causal Models with Latent Variables
This article discusses graphical models that can handle latent variables without explicitly modeling them quantitatively. There exist several paradigms for such problem domains. Two of them are semi-Markovian causal models and maximal ancestral graphs. Applying these techniques to a problem domain consists of several steps, typically: structure learning from observational and experimental data,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1001.1079 شماره
صفحات -
تاریخ انتشار 2010